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Abstract. A regular set of words is (fc-)locally testable if mem- 
bership of a word in the set is determined by the nature of its sub- 
words of some bounded length k. In this article we study groups 
for which the set of all geodesic words with respect to some gener- 
ating set is (fe-)locally testable, and we call such groups (fc-)locally 
testable. We show that a group is 1-locally testable if and only 
if it is free abelian. We show that the class of (fc-)locally testable 
groups is closed under taking finite direct products. We show also 
that a locally testable group has finitely many conjugacy classes of 
torsion elements. 

Our work involved computer investigations of specific groups, 
for which purpose we implemented an algorithm in GAP to com- 
pute a finite state automaton with language equal to the set of 
all geodesies of a group (assuming that this language is regular), 
starting from a shortlex automatic structure. We provide a brief 
description of that algorithm. 



1. Introduction 

Let G be a finitely generated group and let X be a finite symmetric 
(that is, inverse-closed) generating set of G. Let Geo(G, X) be the set 
of all words over X that label geodesic paths in the corresponding Cay- 
ley graph for G. For many classes of groups, including word hyperbolic 
groups [21 Theorem 3.4.5], abelian groups, geometrically finite hyper- 
bolic groups [IB], Coxeter groups [ID] , and Garside groups [2], there 
are generating sets X for which the language Geo(G, X) is regular. 

Starting only from the assumption that Geo(G,X) is regular, we 
know very little about G. We do however have the following result, 
which applies in the more general situation when Geo(G, X) is recur- 
sive. 

Proposition 1.1. With the notation above, if Geo(G, X) is recursive 
and G has a recursively enumerable presentation, then G has solvable 
word problem. 
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Proof. Let to be a word over X. Since Geo(G, X) is recursive, we can 
decide whether or not w G Geo(G,X). If so, then w —a 1 if and only 
if w is the empty word. If not, then there exists a word v with v =g w 
and l(v) < l(w), and so v~ 1 w is a relator of G. But since G has a 
recursively enumerable presentation, we can enumerate the relators of 
G and hence find such a v. We now repeat the process with v in place 
of w. □ 



We are motivated by the question of whether there might be inter- 
esting subclasses of the class of regular languages for which we could 
say more about groups with Geo(G, X) in that subclass. In [7], we con- 
sidered groups for which Geo(G, X) is a star-free regular language (see 
also [T4"l Chapter 4, Definition 2.1] for the definition), and although we 
obtained some conditions under which groups have this property, we 
were not able to obtain any classifications. In [6], we considered groups 
satisfying the much more restrictive hypothesis that Geo(G, X) is lo- 
cally excluding for some symmetric generating set X of G, which means 
that there exists a finite set W of words over X with the property that 
a word w G Geo(G, X) if and only if w does not have a subword equal 
to a word in W. We proved that this is the case if and only if G is 
virtually free. 

In this paper, we consider groups G for which Geo(G, X) is a locally 
testable language for some X. The locally testable languages form a 
class lying between locally excluding and star-free regular languages. 

Informally, where A; is a positive integer, a set of words is /c-locally 
testable if membership of a word in the set depends on the nature of 
its subwords of length k. By a subword of a word aia 2 • • • a n , we mean 
either the empty word or a contiguous substring ajaj+i • ■ ■ dj for some 

1 < i < j < n. 

Examples of locally testable sets are given by the sets of all geodesies 
in both the free group on 2 generators and the free abelian group on 

2 generators, using standard generating sets. The set of geodesies in 
the free group on generators a, b is 2-locally testable, since a word 
in those generators is geodesic if and only if it contains no subword 
equal to any of aa" 1 , a _1 a, bb^ 1 or b~ 1 b. The set of geodesies in the 
free abelian group on generators a, b is 1-locally testable since a word in 
those generators is geodesic if and only if it contains neither both a and 
a^ 1 nor both b and Note that the first example is locally excluding 
but the second is not. We can show similarly that the geodesies of 
free groups and free abelian groups on any number of generators are 
respectively 2- and 1-locally testable. 
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A formal definition of a locally testable language is given in Section 
|2J We shall see that membership of a word w in such a language may 
also depend on the prefix and suffix of the word of length k — 1 as well 
as its subwords of length k. In other words, it depends on the subwords 
of length k in the word ou?o, where o is a symbol not lying in X. 

From a language theoretic point of view, the locally testable lan- 
guages have been well-studied. Various characterizations, provided by 
Brzozowksi and Simon, and by McNaughton, are described in Section 
® 

The local testability of Geo(G, X) will certainly depend on X in 
general. For example, the free group on four generators has Geo(G, X) 
locally excluding and hence locally testable when X is a set of free 
generators for G, but in [7] we exhibited a generating set of size 6 for 
which Geo(G, X) is not star-free and hence not locally testable. 

We shall say that the group G is fc-locally testable if Geo(G, X) is 
fc-locally testable for some generating set X of G, and G is locally 
testable if it is /c-locally testable for some k. 

The 2-generator Artin groups are defined by the presentations 

(a, b | (abf 2 = (ba) k ' 2 ) (k even), 

(a, b | (abY k - l)/2 a = (ba) (k - 1)/2 b) (k odd). 

It is proved in [11] that Geo(G, X) is regular with X = {a, b, a -1 , b^ 1 }. 
In fact it follows directly from Proposition 4.3 of that paper that 
Geo(G, X) is fc-locally testable, so this provides a further series of ex- 
amples of fc-locally testable groups. 

It can be shown that the kernel of the natural homomorphism of A n 
onto the dihedral group of order 2n in which the images of a and b 
have order 2 is a direct product of an infinite cyclic group and a free 
group of rank k — 1. We conjecture that any locally testable group is 
virtually a direct product of free groups. 

We are able to prove a complete characterization of 1-locally testable 
groups. Specifically, in Section [3j we show the following. 

Theorem 13.21 LetG be a finitely generated group. Then the following 
are equivalent. 

(1) G is free abelian. 

(2) G is 1-locally testable. 

(3) There is a finite symmetric generating set X for G such that 
the syntactic semigroup associated to Geo(G, X) is idempotent. 



In Section HJ we prove the following. 
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Theorem 14.11 For any k > the class of k-locally testable groups is 
closed under taking finite direct products. 

In fact, this is deduced as a corollary of the more general result: 

Theorem 14.21 Let C be a class of languages that is closed under the 
operations of union and taking inverse images under length preserving 
morphisms. Then the class of C-groups is closed under taking finite 
direct products. 

In Section we show that the local testability restriction on the 
geodesies results in an algebraic restriction on the group. 

Theorem 15.11 A locally testable group has finitely many conjugacy 
classes of torsion elements. 

Finally, in Section El we describe a method which, upon input of 
a finite presentation of a shortlex automatic group G with symmetric 
generating set X, will attempt to construct a finite state automaton 
with language equal to Geo(G, X). This process is guaranteed to suc- 
ceed eventually if Geo(G, X) is a regular language. This has been 
implemented using the GAP [5] interface to KBMAG 

Using our program to construct a minimal automaton for Geo(G, X) 
when this language is regular, we can then use conditions equivalent 
to those described in Section [2] to decide whether Geo(G, X) is also 
(fc-)locally testable. These conditions, as well as our results, are stated 
throughout this paper in terms of the syntactic semigroup of the regular 
language, but for computational purposes it is generally more efficient 
to work with the the transition semigroup of the minimal automaton 
M; these two semigroups are isomorphic (see for example [14"| p. 18]). 
It is shown in [IT] that local testability can be tested in polynomial time 
from the input M. We conclude the paper by listing some examples 
of the use of our program to test Geo(G, X) first for being regular and 
then for being (fc-)locally testable. 

2. Definition and characterizations of local testability 

Let X be any finite set. Let X* be the free monoid over X, that is 
the set of all strings over X, and let X + be the free semigroup, the set 
of all non-empty strings. 

Let k > be a natural number. For u G X* of length at least 
k, let pre fc (w) be the prefix of u of length k, let suffc(w) be the suffix 
of u of length k, and let subfc(w) be the set of all subwords of u of 
length k. If l{u) < k, then we define pre k (u) = u, suffc(-u) = u, and 
subfc(w) = 0. Two words u,v G X* satisfy u v if and only if 
pre fc _ 1 (-u) = pre fc _ 1 (t'), suffc_i(w) = suffc_i(t>), and subfc(w) = subfc(f). 
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A subset L C X* is denned to be k -locally testable [1, p. 247] if L is a 
union of equivalence classes of The set L is called locally testable 
if L is /c-locally testable for some k. 

Let LT denote the class of locally testable languages and LT(fc) the 
class of all fc-locally testable languages. From the definition as a union 
of equivalence classes, it follows immediately that LT(fc) is closed under 
the Boolean operations of union, intersection and complementation. It 
also follows from the definition that any fc-locally testable language is 
also m-locally testable for all m > k, and hence LT is also closed under 
the three Boolean operations. 

In Proposition 12.11 we list two characterizations of local testability 
due to Brzozowski and Simon, and McNaughton. 

From the first, it follows immediately that LT is strictly contained 
in the class of regular languages over X. (In fact it shows that locally 
testable languages are star-free.) 

The second characterization involves the syntactic semigroup of L, 
SS(-L), which is defined to be the quotient X + / where ~£ is the 
congruence defined as follows. We have u ~^ v if and only if, for 
all words s and t G X*, sut and svt are either both in L or both 
not in L. Note that the syntactic semigroup should be distinguished 
from the syntactic monoid, defined to be the quotient of X* by the 
same congruence. A semigroup S is said to be locally idempotent and 
commutative if, for every idempotent e G S, eSe is an idempotent 
commutative subsemigroup of S. 

Proposition 2.1. (1) [U Theorem 2.1(iv)] LT is the Boolean closure 
of the languages of the form X*wX* , wX* , and X*w, where w G X* . 
(2) [H Theorem 6.2], [121 Main Theorem, p. 63] L is locally testable 
if and only if its syntactic semigroup SS(L) is locally idempotent and 
commutative. 

Using Proposition 12.11 Brzozowski and Simon derived the following 
characterization of the locally testable languages. They stated it in 
terms of the associated minimal automata and transition semigroups, 
but we prefer to phrase it in terms of syntactic semigroups. 

Proposition 2.2. [U Theorem 6.2] A language L is k-locally testable 
if and only if its syntactic semigroup S satisfies the following: 
For all x,y,z G X* such that l(x) = k — 1, 

(1) xyxzx =s xzxyx, and 

(2) whenever xy = zx, then xy =s xy 2 . 

We apply this characterization in the case k = 1 in Section [3] below 
in order to characterize 1-locally testable groups. More generally this 
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characterization provides a test for fc-local testability that can be used 
algorithmically. 

3. 1-LOCALLY TESTABLE GROUPS 

Given a word w defined over X, we define the support of w, supp(w), 
to be the subset of X consisting of the symbols in X that appear in the 
word w. A set L of words over an alphabet X is 1-locally testable if 
membership of a word in L can be determined simply from examination 
of which symbols appear in the word, irrespective of where they appear; 
that is, for some finite set S of subsets of X, w is in L precisely if 
supp(w) G S. 

From Proposition 12. 21 we see that a language L is 1-locally testable if 
and only if the following two conditions hold in the syntactic semigroup 
S = SS(L), for all representatives in y, z G X*: (1) yz —s zy and 
(2) y =s y 2 '■ Note that there is no x in these conditions because the 
x of the conditions of Proposition 12.21 has length in this case. This 
gives us the following. 

Corollary 3.1. A language L is 1-locally testable if and only if its 
syntactic semigroup SS(L) is idempotent and commutative. 

Using this result we can characterize 1-locally testable groups. 

Theorem 3.2. Let G be a finitely generated group. Then the following 
are equivalent. 

(1) G is free abelian. 

(2) G is 1-locally testable. 

(3) There is a finite symmetric generating set X of G such that the 
syntactic semigroup of Geo(G, X) is idempotent. 

Proof. Suppose that G is finitely generated by Y = {yi, . . . y n } and let 
X = YUY~ 1 . 

If G is free abelian on Y, then a word w over X is geodesic in G if 
and only if its support is a subset of one of the 2™ distinct sets 

{yl\y?,...,yZ} 

defined by sequences (e 1; e 2 , . . . , e n ) G {1, — l} n . Hence G is 1-locally 
testable. 

Corollary 13.11 shows that if G is 1-locally testable, then (3) holds. 

Finally suppose that the syntactic semigroup S := SS(L) is idempo- 
tent, where L := Geo(G, X). For any word w, we have w =s w 2 , and 
so (by the definition of the congruence ~l) for all words u,v G X* we 
have uwv G L <^=^> uw 2 v G L. In particular, w m G L <^=^> w m+1 G L 
for all positive integers m. Hence if w is geodesic, then w m must also 
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be geodesic for all m > 0. This implies in particular that G must be 
torsion-free. 

Now let x G X be any generator, let g 6 G be any element, and 
let w be a geodesic word representing g~ x xg. Then, since w m must 
be a geodesic representative of g~ l x m g for all m > 1, we see that w 
must have length 1. So i has finitely many conjugates, and hence its 
centralizer has finite index. Therefore the center Z(G) has finite index 
in G. 

Next we apply [T5], Thm 10.1.4] to deduce that the derived subgroup 
G' must be finite. Since G is also torsion-free this implies that G' is 
trivial, and so G is abelian. Thus G is free abelian. □ 

We note that we have definitely made use of the group structure 
here. In general, an idempotent syntactic semigroup is not necessarily 
commutative. As an example consider the 2-locally testable language 
over {a, b} consisting of strings that do not start with b. Its syntactic 
semigroup S is idempotent but ab ^ s ba. However, it is a consequence 
of the above theorem that if the syntactic semigroup of the language 
of geodesies of a group is idempotent, then the group is commutative, 
and hence so is the syntactic semigroup. 

4. Closure under finite direct products 
In this section we prove the following theorem. 

Theorem 4.1. For any k > 0, the class of k-locally testable groups is 
closed under taking finite direct products. 

In fact the theorem is a special case of the following more general 
result. For a class of languages £, we understand an £-group to be 
a group G for which Geo(G, X) is in C, for some finite symmetric 
generating set X. 

Theorem 4.2. Let L be a class of languages that is closed under the 
operations of union and taking inverse images under length preserving 
morphisms. Then the class of C-groups is closed under taking finite 
direct products. 

Proof. Suppose that the geodesic languages Li and L 2 of the groups G 
and H over the symmetric generating sets X and Y are in C We adjoin 
generators to each of X and Y that are equal to the identity elements 
of the respective groups; since any words containing those generators 
are non-geodesic, this action does not change L\ or L 2 , and hence it 
does not affect their membership in L. 
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Define Z := {(x,y) : x G X, y G Y} C G x if. Then Z generates 
G x H (since X and Y each contain an identity). We define L to be 
the geodesic language for G x H over Z. 

For any word w over Z, define 7r 1 (w) to be the projection of w onto 
X* and 7i 2 (w) to be its projection onto Y*. Then, for w G Z*, we 
see that if w is to be non-geodesic, both of its projections must be 
non-geodesic, and conversely. In other words, the following holds. 

w G L -<=>- ni(w) G Li V 7r 2 (u>) G L 2 . 

Since 7Ti and 7r 2 are length preserving morphisms, the result now 
follows immediately. □ 

In order to deduce Theorem I4.ll from Theorem I4.2I we simply need 
the following lemma. 

Lemma 4.3. The class of k-locally testable languages is closed under 
the operations of union and taking inverse images under length pre- 
serving morphisms. 

Proof. Closure under union is immediate from the definition. The sec- 
ond claim follows from the fact that, if : A* — > B* is a length preserv- 
ing morphism and v, w G A* with v ~^ w, then <p{y) ~fc <p(w) (where 
~/t is as in the definition of local testability in Section [2]). □ 

Note that Theorem I4.2I can also be applied to the class of star-free 
languages. That result is also a special case of [7J Theorem 5.3], which 
is proved using a different construction, and different generating set. 

At this point it is natural to ask whether the class of groups with 
a locally testable language of all geodesies is closed under the free 
product operation. Although we do not have a definitive answer to 
that question, we suspect that the answer is no, and the following 
example shows that the natural generating set does not work. 

Consider the group G = Z 2 * Z = ((a) x (&)) * (c). Giving the Z 2 
subgroup the generating set from the proof of Theorem I4.2I consider 
the generating set 

(a- 1 ,l),(a- 1 ,6),(a- 1 ,r 1 ),c,c- 1 } 

for G. For every natural number n, the language Geo(G, X) of geodesies 
contains the word u = (1, b) n (a, b) n c n (a, b) n (a, b~ 1 ) n (a, b) n but does not 
contain the word v = (l,b) n (a,b) n (a,b~ 1 ) n (a,b) n c n (a,b) n . However, 
u ~ n v. So Geo(G, X) is not a union of ~ n equivalence classes for any 
n, and hence is not locally testable. 
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5. Locally testable geodesics and torsion conjugacy 



In this section we prove the following. 

Theorem 5.1. A locally testable group has finitely many conjugacy 
classes of torsion elements. 

First we have a useful lemma. 

Lemma 5.2. Let X be a set and let k > be a natural number. There 
exists a natural number N such that whenever w is a word over X of 
length at least N, then w 2 ~^ for all j > 2. Moreover, there exists 
a cyclic permutation w of w satisfying w ~^ v$ f or a ^ j > 1- 

Proof. Let N := 2k(\X\ 2k + 1), and suppose that w is a word with 
length l(w) > N. Since l(w) > k, for j > 2 we have pre^^w) = 
pre^O 2 ) = pre^fV), suf k -i(w) = suf fc _i(u; 2 ) = suf fc _i(V) and 
subfc(u> 2 ) = subkivji), as required. 

By our choice of N, we can write w = W1W2 ■ ■ ■ W\xm + ]W' , where 
each subword Wi has length 2k. There are only \X\ 2k distinct words 
over X of length 2k, so for some i ^ j, we have W{ = Wj. Hence we can 
write w = xuyuz for some word u of length 2k and x,y,z G X*. Also 
write u = st for words s, t G X* of length k each. Then w = xstystz. 

Next let w := tystzxs, a cyclic permutation of the word w. By 
analyzing prefixes, suffixes, and subwords of w and -uP as above, we 
find that w ~fc ?P for all j > 1. □ 

Now we prove Theorem 15.11 

Proof. Suppose that X is a finite symmetric generating set for G and 
that L := Geo(G,X) is fc-locally testable. Let N := 2A;(|X| 2fe + l) as in 
the proof of Lemma [5721 Suppose that w is a word of length l(w) > N, 
and that w represents a torsion element of G. 

Lemma 15.21 says that there is a cyclic permutation (and hence a 
conjugate) w of the word w such that w ~& w % for all i > 0. Then 
w G L <^=^ w l G L, so w is geodesic if and only if w 1 is. Since w 
represents a torsion element, w must also be torsion in G, and so for 
some i, and hence for all i, the word w l is not geodesic. Therefore 
w =g v for some word v satisfying l{y) < l(w) and representing an 
element of G in the conjugacy class of w. 

If l(v) > N, then repeat this argument to obtain words representing 
conjugates of w of successively strictly shorter length. Eventually this 
process must end with a word u of length l{u) < N such that u and w 
are in the same conjugacy class. Therefore there are only finitely many 
conjugacy classes of torsion elements. □ 



classes 
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6. Constructing a word acceptor for regular geodesics 

In this section, we describe a method for constructing a finite state 
automaton that accepts the set of all geodesic words in a finitely pre- 
sented shortlex automatic group, in the case that this set is a regular 
language and hence such an automaton exists. For Coxeter groups, a 
method for doing this is described in [10], which is considerably faster 
than the general approach presented here. 

We suppose that our group G is generated by the symmetric set X, 
and as usual let Geo(G, X) C X* be the set of all geodesic words. We 
also suppose that we have successfully computed the shortlex automatic 
structure of G with respect to some ordering of X. We shall assume 
throughout this section that X* is ordered by the associated shortlex 
ordering. 

The procedure to be described here, which attempts to construct 
a finite state automaton GW accepting a language L(GW) equal to 
Geo(G, X), will succeed eventually if Geo(G, X) is a regular language. 
If G is word-hyperbolic, then the method described in Section 3 of [1] 
will construct GW. Otherwise, we (repeatedly, if necessary) construct 
candidates for GW, and try to prove their correctness. We shall first 
discuss our method for proving correctness of GW, which will succeed 
if and only and GW really is correct (i.e. L(GW) = Geo(G, X)), and 
then discuss how to come up with suitable candidates. 

The automatic structure computed includes the shortlex word ac- 
ceptor W, the multiplier automata M x for i£lU {e} (which we shall 
not need), and also a word- difference automaton D with the following 
properties: 

(Dl) (u,v) + G L(D) =>- u —q v\ 

(D2) If u, v G X* with m,d6 L(W), x G X U {e} and ux = G v, then 
(ux,v)+ G L(D). 

Here (u,v) + denotes the padded pair corresponding to u, v G X*. In 
fact, the construction of D is such that its start state o is its only 
accepting state, and D contains transitions labeled (a, a) from o to a 
for all a G X . It therefore has the additional property: 

(D3) If {u,v) + G L(D) and w,w' G X* then (wu,wv) + G L(D) and, 
if l{u) = l(v), then (wuw' , wvw') G L(D). 

For a candidate GW for a finite state automaton with language 
Geo(G, X), we use standard operations on finite state automata, as 
described in Chapter 1 of [3] or Section 13.1 of [S], to check whether 
GW satisfies the hypotheses of the following theorem. If so, then the 
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theorem tells us that GW is correct (that is, L(GW) = Geo(G,X)). It 
is easy to see that these conditions are also necessary for the correctness 
of GW, so our procedure will succeed if and only if GW is correct. 

Theorem 6.1. Let GW be a finite state automaton over X which sat- 
isfies the following conditions: 

(i) L(GW) is prefix- closed; 

(ii) L(W) C L(GW); 

(iii) If {u,v) + G L(D) with l(u) = l(y), then u G L(GW) v G 
L(GW); 

(iv) If{u,v) + G L(D) with l(u) > l{v), then u L(GW). 
Then L(GW) = Geo(G,X). 

Proof. Suppose that Geo(G, X) % L(GW) and let w be the shortlex 
least element of Geo(G,X) \ L(GW). Then, by (ii), w £ L(W). Let 
w = w'uw", where u is of minimal length with u (jL L(W). Then, by 
(D2), we have {u,v) + G L(D) for some v G L(W). Since by (Dl) 
u =g v, and w G Geo(C7, X), we must have l(u) = l{v) and hence, by 
(D3), {w'uw" ,w'vw") G L(D). Moreover w'vw" G Geo(C7,X). Now, 
since v G L(W), we have u > v (recall that we are using the shortlex 
ordering on X*) and hence w = w'uw" > w'vw", and then w'vw" G 
L(GW) by choice of w. But this contradicts (iii). 

Suppose, on the other hand, that L(GW) $2 Geo(G, X) and let w be 
the shortlex least element of L(GW)\Geo(C7, X). Then w G" Geo(C7, X) 
implies w (jL L(W), and again we can write w = w'uw", where u is of 
minimal length with u G" L(W). Again we have (u,v) + G L(D) for 
some v G L(W). If l(u) > l(v) then, by (i) and the minimality of w, 
we must have w = w'u, and hence, by (D3), {w'u,w'v) + G L(D). But 
this contradicts (iv). On the other hand, if l{u) = l(v) then, by (D3), 
(w'uw" , w'vw") G L(D), which contradicts (iii) again. □ 

This gives the method of testing, for a candidate automaton GW, 
whether L(GW) = Geo(G,X). Now we turn to the problem of con- 
structing potential candidates for GW. 

In the methods to be described below, we can prove that if Geo(G, X) 
is regular and if we run the relevant programs for sufficiently long, 
then GW will be correctly calculated. Although we have no means 
of estimating for how long we actually need to run the programs, we 
can test a sequence of candidates for correctness, and thereby produce 
a terminating algorithm to compute GW with L(GW) = Geo(G,X), 
provided of course that Geo(G, X) is regular. 
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As mentioned previously, if G is word-hyperbolic, then the proce- 
dure described in Section 3 of [4] will construct the correct automa- 
ton GW. This method can be summarized as follows. Starting from 
the automaton GWo := W, the shortlex word acceptor in the auto- 
matic structure for G, we construct the automaton GWi which accepts 
all words v for which there exists a w G L(GWo) with l(w) = l(v) 
and with (v,w) in the language of the word-difference machine D. 
So L(GW ) C L(GWi) C Geo(G,X). We can construct GW, from 
GW,_i in the same way, for all % > 0. It can be shown that, if 
L(GWj) = L(GWi-i) for some i, then L(GWj) = Geo(G,X), and 
that this is the case if and only G is word-hyperbolic. 

If applied to a group that is not word-hyperbolic, then this proce- 
dure will not terminate, and will construct a sequence GW, (i G N) 
of automata with L(GW<) C Geo(G,X), L(GWj) C L(GW W ), and 
Geo(G, X) = U^ 1 L(GWj). So, for any k > 0, there exists an n such 
that, for all % > n, L(GWj) has the following property: 

(Pfc) the set of words of length at most k in L(GWj) is equal to the 
set of words of length at most k in Geo(G,X). 

We apply a method described by Trakhtenbrot and Barzdin in |16, 
Section IV. 2] to construct candidates GW for an automaton that sat- 
isfies L(GW) = Geo(G, X). This procedure, as presented in [16], as- 
sumes the existence of an automaton M, and takes as input a set of 
pairs (w, b), where w is a word and b is true or false, with b = true if 
and only if w G L(M). It attempts to construct M from this informa- 
tion. It is proved in [TBI Theorem 2.16] that, if M is known to have at 
most n states and the procedure is given the membership of all words 
of length at most 2n — 1 in L(M) as input, then it will successfully 
construct M. 

We shall apply a variant of this procedure using an automaton M' 
and an integer k > as input, and use the membership of words of 
length at most k in L(M') as our criterion to determine whether they 
lie in L(M). It will either abort or output a new automaton c(M', k); 
specifically, we apply it to GWi, for some i and k. 

It follows from [TBI Theorem 2.16] that, for any finite state automa- 
ton M, there exists an integer B(M) > 0, such that if k > B(M) and 
the sets of words of length at most k in L(M) and L(M') are equal, 
then the procedure applied to M' will output M as c(M',k). If M 
has n states, then this is true with B(M) = 2n — 1, but it is proved 
in [TBJ Theorem 5.10] that there is a constant C such that, for almost 
all automata M, B(M) < C log| A |(n), where A is the alphabet of M. 
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Unfortunately, we know no upper bound on the number of states of 
M, so we have no way of estimating B(M) in advance. However, since 
we know that, for any k, is satisfied by GWj for all sufficiently large 
i, the following is true. If Geo(G, X) is a regular language and if we 
calculate GWj for sufficiently large i and then apply the procedure for 
sufficiently large k, then an automaton GW with L(GW) = Geo(G,X) 
will be output as c(GWj, k). 

Here is a summary of our version of the Trakhtenbrot and Barzdin 
procedure. For a state a of a deterministic finite state automaton M 
with start state <r , we define the depth of a to be the length of a 
shortest word w with <Jq=<j (where is the state reached from <jq 
on reading the string w). For an integer k > 0, define M(a, k) to be the 
set of words w with l(w) < k such that a w is an accept state of M. For 
two states a, r of M, let d be maximum of the depths of a and r and, 
for k > d, define a and r to be k- equivalent if M(cr, k — d) = M(r, k — d). 
Note that, since the value of d depends on the pair of states a, r, k- 
equivalence is not necessarily an equivalence relation on the set of all 
states of M, although it will be for sufficiently large k. 

With input M' and k, the procedure attempts to define an automaton 
c(M', k) in which the set of states is the set of fc-equivalence classes of 
states of M', and the transitions are induced from those of M'. If k- 
equivalence turns out not to be a equivalence relation, or if transitions 
with a given label from /c-equivalent states do not lead to fc-equivalent 
states, then the procedure aborts. Otherwise it outputs c(M', k). 

Unfortunately, we do not know 5(GW) in advance, and neither do 
we know how large i must be to guarantee that GW, satisfies We 
have not yet implemented any heuristics for estimating these values; 
we have simply guessed at them. 

We finish with some examples in which we have successfully com- 
puted GW with L(GW) = Geo(G,X), together with the values of k 
and i used, and the number of states of GW. The first example is 
a 5-generator presentation of the wreath product of an infinite cyclic 
group with the group of order 2. There are presentations on fewer 
generators with regular sets of geodesies, but with this presentation 
Geo(G, X) also turns out to be 2-locally testable. The second, third 
and fourth examples are 2-generator Artin groups and they each have 
locally testable sets of geodesies. Indeed, we checked, using the condi- 
tions in Proposition [2721 t na t they are /c-locally testable for k — 3, 4 and 
5, respectively, a fact which we know to be true from [Til Proposition 
4.3]. 

The final example is a 4-generator Coxeter group for which Geo(G, X) 
is regular but not locally testable; in fact it is not even star-free. As 
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mentioned at the beginning of this section, it is more efficient to use 
the method described in [TO] to compute GW for this example. 

1. G = ( a, b, t, it, v | ab = ba, t 2 = 1, tat = b, at = u, bt = v); i = 3, 
k — 4, 10 states. 

2. G = (a,b | aba = bab); i = 3, k = 7, 28 states. 

3. G — ( a, b | abab = baba } ; i — 4, k — 10, 61 states. 

4. G = ( a, b | ababa = babab); i = 5, k = 12, 115 states. 

5. G = ( a, b, c, d \ a 2 = b 2 = c 2 = d 2 = {ab) 3 = {be) 3 = {cdf = {da) 3 = 
{ac) 2 = {bd) 2 = l); % = 6, k = 14, 125 states. 
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